A Novel Method for Wavelet Quantization of Noisy Speech
نویسندگان
چکیده
This paper proposes an architecture for low bit rate coding of noisy speech. The input noisy speech is decomposed into multiresolution signal components using wavelet transform. An iterative Wiener filtering is used at each level of wavelet analysis to enhance speech. The system model that evolves during enhancement is processed further to get optimal parameters for the quantization. A multistage vector quantizer is used for compression of decomposed speech. The enhanced speech is reconstructed at the receiving end by a VQ decoder and the necessary wavelet reconstruction network. Speech coding rate for the proposed architecture is estimated to be about 2.37Kbps.
منابع مشابه
Wavelet Quantization of Noisy Speech Using Constrainedwiener
In this paper we propose an architecture for low bit rate coding of noisy speech. The input noisy speech is decomposed into multi-resolution signal components using wavelet transform. An iterative Wiener lter-ing is used at each level of wavelet analysis to enhance speech. The system model that evolves during enhancement is processed further to get optimal parameters for the quantization. A mul...
متن کاملA New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain
Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...
متن کاملCompact Speech Features Based on Wavelet Transform and Pca with Application to Speaker Identification
The main goal of this paper is to find some effective methods to improve the performance of speaker identification system. In speaker identification, we use wavelet transform to decompose the speech signals into several frequency bands and then use cepstral coefficients to capture the individualities of vocal track within the interested bands based on the acoustic characteristic of human ear. I...
متن کاملIntegrated speech enhancement and coding in the time-frequency domain
This paper addresses the problem of merging speech enhancement and coding in the context of an auditory modeling. The noisy signal is rst processed by a fast wavelet packet transform algorithm to obtain an auditory spectrum, from which a rough masking model is estimated. Then, this model is used to re ne a subtractive-type enhancement algorithm. The enhanced speech coe cients are then encoded i...
متن کاملRobust Speaker Identification System Based on Two-Stage Vector Quantization
This paper presents an effective method for speaker identification system. Based on the wavelet transform, the input speech signal is decomposed into several frequency bands, and then the linear predictive cepstral coefficients (LPCC) of each band are calculated. Furthermore, the cepstral mean normalization technique is applied to all computed features in order to provide similar parameter stat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000